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Structural studies of biological macromolecules 
are severely limited by radiation damage. Tradi- 
tional crystallography curbs the effects of dam- 
age by spreading damage over many copies of 
the molecule of interest. X-ray lasers, such as 
the recently built LINAC Coherent Light Source 
(LCLS) [Ij, offer an additional opportunity for 
limiting damage by out-running damage pro- 
cesses with ultrashort and very intense X-ray 
pulses. Such pulses may allow the imaging of sin- 
gle molecules, clusters or nanoparticles, but co- 
herent flash imaging will also open up new av- 
enues for structural studies on nano- and micro- 
crystalline substances. This paper addresses the 
theoretical potentials and limitations of nanocrys- 
tallography with extremely intense coherent X- 
ray pulses. We use urea nanocrystals as a model 
for generic biological substances and simulate pri- 
mary and secondary ionization dynamics in the 
crystalline sample. Our results establish condi- 
tions for ultrafast nanocrystallography diffraction 
experiments as a function of fluence and pulse du- 
ration. 

Any sample exposed to an intense X-ray pulse will 
be ionized and extensive ionization destroys the sample. 
The time scale on which this process occurs is critical 
for obtaining an interpretable diffraction pattern that 
conveys an atomic structure of the sample. In princi- 
ple, the X-ray pulse must be short enough for the en- 
tire pulse to pass through the sample before major disar- 
rangement of atomic and electronic configurations takes 
place. The ionizations due to direct photoabsorption and 
subsequent secondary processes affect the ability to get 
useful structural information from the diffraction pat- 
tern in three ways: (i) Ionization decreases the elastic 
X-ray scattering power of atoms and induces consid- 
erable changes in diffracted intensities due to ionization 
stochasticity. (ii) Removal of electrons from atoms leaves 
behind positively charged ions that repel each other due 
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FIG. 1: Comparison of crystal size and the modeled 
size of secondary electron cascades, (a) The unit cell of 
a urea crystal contains light elements abundant in proteins: 
carbon (depicted as green), nitrogen (blue), oxygen (red) and 
hydrogen (white), (b) A urea nanocrystal of 200 nm would 
contain about 50 million unit cells. A protein nanocrystal 
of similar size would contain about 20,000 unit cells (using 
lysozyme as an example), (c) The overall dimensions of sim- 
ulated electron clouds produced during the thermalization of 
a single 0.4 keV Auger electron ejected from a nitrogen atom 
(top) and a single 8 keV photoelectron (bottom) inside a large 
urea crystal after 50 femtoseconds. Similar cascade sizes are 
produced in protein crystals, during an X-ray diffraction ex- 
periment. The total number of ionizations was 18 in the Auger 
cascade, and 390 in the photoelectron cascade at 50 fs after 
the emission of primary electrons. At this point, the radius of 
gyration of the photoelectron cloud reached 300 nm, and that 
of the Auger electron cloud 8 nm. The photoelectron cascade 
is bigger than a typical nanocrystal under consideration here. 



to Coulomb forces, leading to the destruction of the struc- 
ture, (iii) Free electrons either leave the sample, if their 
energy is high enough, or remain in the sample as a back- 
ground electron gas, in which case they will be a source 
of noise in the diffraction image. 
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FIG. 2: Evolution of secondary ionization cascades in a urea crystal over time, (a) Number of secondary ionizations 
produced by a photoelectron of 8 keV and by Auger electrons (impact energies: 250 eV for carbon, 400 eV for nitrogen, 
500 eV for oxygen), (a) Spatial evolution of the electron cloud from a single photoelectron of 8 keV energy in an infinitely 
large urea crystal depicted through the radial electron density as a function of time and radial distance from the point of 
incidence. The cascade includes the primary electron and its secondary electrons. At any given time, the 47r integration over 
the radius in the 3D volume gives the total number of electrons, assuming spherical symmetry when the cascades are added 
stochastically, (c) Secondary electron cloud from a single Auger electron (nitrogen). The cascade includes the primary electron 
and its secondary electrons. The thermalization of electrons from oxygen and carbon has similar features. Black lines in (b) 
and (c) show the radii of gyration (Equation [T]) of the electron clouds. In the first femtosecond the electron clouds are highly 
anisotropic. After 20 fs no more secondary ionizations will occur in the photoelectron cascade (5 fs for Auger cascades). Figures 
show averaging over 1000 simulations on an infinitely large urea crystal. 



Thermalization of trapped electrons leads to additional 
ionizations through cascade processes. The probability 
of trapping, and the size of the resulting secondary elec- 
tron cascades, depends on sample size and X-ray energy 
(among others). Photoelectrons released by X-rays of 
1.5 A wavelength are fast (53 nm/fs) and can escape from 
small samples such as "nanosized" crystals [H |3] early 
in an exposure (Figure [T]). In contrast. Auger electrons 
from carbon atoms are slow (9.5 nm/fs) and cause sec- 
ondary ionization even in a single protein molecule [H |5] . 
For small samples (diameter < 10 nm), the explosion is 
dominated by the repulsion of positive ions left behind 
by electrons leaving the sample [4( l6l. In big samples 
(diameter>500 nm), most electrons will be trapped sim- 
ply because they lose energy before reaching the surface. 
Trapped electrons increase the temperature of the sample 
through collisional processes, while slowing the explosion 
by partially screening the positive charges and creating a 
net neutral core. Predictions point to a transition from 
Coulomb explosion to a hydrodynamic expansion with in- 
creasing sample size. A positively charged surface layer is 
formed, destroying the sample from outside towards the 
center. The expansion is driven by thermal processes as 
the electron pressure grows [7 . Hence, crystalline diffrac- 
tion and useful structural information might still be ob- 
tained from the center of the crystal. 

Descriptions of electron impact ionization and sec- 
ondary ionization cascades exist for different materi- 
als [5l ISl [9] . Dynamics of photoelectrons in protein crys- 
tals have been investigated earlier ^ , without consid- 
eration to Auger emission or secondary ionization cas- 
cades. These predictions suggest that radiation damage 
can be limited by reducing the crystal size. The present 
work steps beyond these studies and gives an integrated 



description of photo-emission. Auger emission and cas- 
cade processes during exposure of a biological nanocrys- 
tal to an ultrashort and intense X-ray pulse, to determine 
the feasibility of nanocrystal imaging and improvement 
in resolution achievable with shorter pulses. Our find- 
ings are summarized in Figures [l]|4] and the methodology 
is described in Methods and Supplementary. 

For light elements, a single photo-ionization releases 
electrons at two distinctly different energies (Figure [T]) . 
The photoelectron energy corresponds to the difference 
between the photon energy and the K-shell binding en- 
ergy, while Auger electrons carry kinetic energy depen- 
dent on atom type (250 eV for carbon). The average 
time for the first collisional ionization scales with the 
primary electron energy (Figure [2|i). The electron cloud 
initiated by a photoelectron thermalizes slower than elec- 
trons in Auger cascades, since energetic electrons travel 
further between scattering events in the crystal due to 
their lower interaction cross section (Figures [2]3,c). At 
the same time, the cloud generated by a photoelectron is 
around four orders of magnitude larger in volume than 
the Auger electron induced cloud. After thermalization, 
the electron clouds keep expanding through diffusion, fol- 
lowing a random walk pattern. Figures [2]3,c show that 
the radius of gyration (Equation [T]) at these impact en- 
ergies describes well the spatial extent of the electron 
clouds. 

In a sample that is small compared to the size of the 
X-ray beam or the photon absorption length, photoion- 
ization events will occur with equal probability through- 
out the entire sample. At 8.3 keV photon energy, a sin- 
gle photoelectron will liberate about 400 electrons before 
reaching thermal equilibrium (Figure [2|l). The electron 
gas will have high temperature due to the high photon en- 
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FIG. 3: Degradation of the detected signal as a result of radiation damage, (a) Decay of Bragg peaks during exposure 
to an X-ray pulse in a urea crystal. Pulse length: 10 fs (FWHM) centered at t=0, at 1.5 A wavelength. The pulse intensity 
(1.5x10^^ photons in a focal spot of 1 /xm diameter FWHM) is such that atoms are ionized once in average when 99.5% of 
the intensity of the pulse passes through the sample. Peak intensities at different resolutions are represented by the (hkl) 
reflections and are normalized to one, based on intensities from undamaged crystals. The (330) reflection corresponds to 1.2 A 
resolution and has a pulse-integrated degradation of 50% due to ionization and atomic displacement. The dashed black line 
shows the average root mean square deviations (RMSD) in atomic positions during illumination. (b,c) Contour plots for the 
average ionization per atom (z) and the R- value as a function of the X-ray pulse length and intensity (photons/focal spot of 1 
fim diameter FWHM). The R- value (Equation [sj is calculated from all the Bragg peaks up to a resolution of 1.5 A. The red 
thick line corresponds to an R- value of 0.15, lower values are considered acceptable for a good reconstructable signal. The blue 
thick dashed line represents the damage of 1 ionization per atom. The plot in (b) shows behavior of nanocrystals smaller than 
200 nm, from which the photoelectrons escape early in an exposure, while (c) shows the behavior of crystals larger than 500 
nm, when photoelectrons are completely trapped during exposure. 



ergy, and the electrons will be distributed approximately 
uniform throughout the sample. The free electrons will 
scatter predominantly in the forward direction and con- 
tribute incoherently to the background in the diffraction 
pattern. 

At 1.5 A wavelength, the ratio between elastically scat- 
tered photons and photoionization is 1:32 for oxygen, 1:26 
for nitrogen and 1:20 for carbon. Incoming photons will 
primarily ionize sample and only a few will contribute 
to coherent scattering. The loss of an electron will de- 
crease the scattering power by 17% for a carbon atom, 
14% for nitrogen and 12% for oxygen. One ionization 
per atom also leads to atomic displacement and further 
degradation of the scattered signal (Figure [3|i). 

Since a focused X-ray pulse will destroy the sample, 
three-dimensional (3D) structure determination relies on 
the experiment being repeatable. Rather than build- 
ing up a complete X-ray diffraction data set by rotat- 
ing the crystal and collecting a sequence of diffraction 
images it will be necessary to scale together individual 
diffraction images from many different nanocrystals, in 
order to assemble a complete 3D data set [10 . A crys- 
tal with 5x5x5 unit cells will produce a discrete diffrac- 
tion pattern [11], and conventional X-ray analysis tech- 
niques may be used for indexing, merging and recon- 
struction [To]. Furthermore, oversampling techniques for 
direct phase retrieval may also be employed for a 3D 
structural determination [IT. 

We express damage-induced errors in terms of degra- 
dation of Bragg peaks, and for the entire diffraction 
pattern we calculate an R-value from simulated crys- 



tals exposed to X-ray pulses (Figure [3|. The R-value 
is a measure of the overall agreement between the crys- 
tallographic model and the experimental X-ray diffrac- 
tion data (Equation [3|. Small molecules (such as urea) 
form more ordered crystals and an R-value below 0.05 
is considered a good threshold (Cambridge Structural 
Database). For macromolecules, values up to 0.20 are 
acceptable (Protein Data Bank) and we use the conven- 
tion R< 0.15 from [11 (Figure [3|. Figure [3|i shows how 
Bragg peaks may degrade during exposure to an X-ray 
pulse, and how this influences the dependence of the R- 
value with pulse parameters. The R-value is also de- 
pendent on crystal size, and Figures [3]3,c compare two 
regimes: crystals where Auger cascades dominate versus 
crystals where damage is driven by photoelectron cas- 
cades. Trapping of photoelectrons in crystals larger than 
500 nm leads to a steeper degradation of the signal and 
constrains what pulse lengths and intensities could be 
used for successful imaging. 

When deciding which parameters of the X-ray laser 
pulse and sample characteristics one should use (Fig- 
ure [4| , there is an interplay between two effects driven 
mainly by photoelectrons; i) If photoelectrons escape the 
sample, the total number of ionizations will be signifi- 
cantly lowered. To reduce radiation damage early in the 
exposure, the sample has to be smaller than the size of 
the photoelectron cascade. The diffraction signal from 
the crystal scales with the size of the crystal as a power 
law (Equation [5|. Thus reducing sample size will con- 
versely require an increase in pulse intensity in order to 
retain the signal at the same level (Figure pi), ii) If the 
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FIG. 4: Photon signal as a function of unit cell and 
crystal size for a perfect crystal. The integrated Bragg 
peak intensity (Equation [s]) is shown for the lowest resolution 
where a full reflection can be recorded (Equation [4]). Fully 
integrated Bragg peaks can be used for indexing and averag- 
ing the signal from different nanocrystal orientations in the 
beam, and 10 detected photons are considered to give a good 
signal-to-noise ratio. The X-ray pulse has an intensity of 10^^ 
photons focused in a focal spot of 1 /xm diameter (FWHM), 
wavelength 1.5 A, beam divergence is 0.1 mrad and spectral 
bandwidth is AA/A = 0.1%. The solid lines correspond to 
a scattered signal of 1, 10 and 100 photons in the first fully 
integrated Bragg peak, when peak degradation is not taken 
into account. The signal scales with the X-ray fluence, thus 
the "1" line will correspond to 10 scattered photons for the 
case of 10^^ incident photons, and the "100" line will corre- 
spond to 10 scattered photons for the case of 10^° incident 
photons. If larger bandwidth or divergence is expected, full 
Bragg peaks can be recorded at lower resolutions and conse- 
quently the detected signal will be higher (Equation [sj . The 
inset shows details for crystals with small unit cells. 



pulse is very short, the photoelectric cascade will not have 
time to develop to reach a large number of ionizations. 
At the same time, short pulses considerably reduce the 
signal degradation due to atomic displacements and ion- 
ization (low R- value in Figure [3|. In this case the sample 
size is less important, and one can investigate any crystal 
size at photon fluences that will provide enough signal. 
The size could however be constrained by coherence re- 
quirements due to the pulse length [13]. At extremely 
short pulses, one would also need to consider the broad- 
ening of Bragg peaks [14 , i.e. a bandwidth effect. 

The above considerations stress the importance of hav- 
ing very short pulses as means for radiation damage 
control, to reduce both the ionization cascades and the 
atomic disorder. A photon flux of 10^^ photons per 
pulse and unit area {jiw?) will offer the opportunity 
to investigate a wider range of crystal sizes and unit 
cells sizes. For lower available intensities (10^^ pho- 
tons/pulse), longer pulse lengths can be accommodated 
and imaging nanocrystals of small proteins with a small 
unit cell, such as lysozyme, could be feasible. Our cal- 
culations show that to achieve an R- value of 0.15 at a 
fluence of 10^^ photons//im^ pulses have to be shorter 
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than 10 fs for crystals larger than 500 nm, where as for 
small crystals (< 200 nm) pulse lenghts can be as long 
as 30 fs. 

Ultrafast single-shot nanocrystallography fills the gap 
between single molecule imaging and crystallography. It 
offers the opportunity to investigate biological molecules 
which are too small to provide a good signal on their 
own in an X-ray laser diffraction experiment, however 
they could form nanocrystals which would be too small 
to investigate with conventional synchrotron radiation. 



Methods 

1. Electron impact ionization. Simulations 
of the ionization cascade dynamics in crystalline urea 
(CO(NH2)2) were performed using the spatial electron 
dynamics program, EHOLE, that is a part of the GRO- 
MACS [15 Molecular Dynamics software package. Urea 
was chosen as model for a biological sample for three rea- 
sons: it has a well known crystalline structure, it has an 
atomic composition of biological character, and its unit 
cell is small. In earlier work [5^, the inelastic electron 
cross sections for urea have been derived from first princi- 
ples calculations. Based on these we have calculated the 
number of secondary electrons generated by an impact 
electron in a urea crystal. The inelastic cross section for 
electron scattering in urea is comparable in magnitude 
with that for water [5 . Thus, urea crystals are a good 
model for protein nanocrystals, known to contain 30%- 
60% water. We refer to O |9l [TQ and the supplementary 
material for further details of these calculations and how 
the model compares with experiments on diamond pTj . 
Considering rrii to be the mass of electron i and the 
position of electron i with respect to the center of mass of 
all free electrons, the radius of gyration, used in Figure [2j 
is defined as 

^.<"^(^)"^ « 

2. Electron thermalization during the pulse. We 

assume that the X-ray pulse can be described by a Gaus- 
sian centered at time to = and will consider the incom- 
ing X-ray photons to be unpolarized and have a wave- 
length of 1.5 A. Following this pulse, several primary 
ionizations are treated - the photoelectric effect result- 
ing in an ejection of a high energy electron (^ 8 keV), 
accompanied by an Auger effect which provides an elec- 
tron of a lower energy, depending on atomic species. The 
emission for these electrons is described by normalized 
probability distributions: (i) the photoelectric effect is in- 
stantaneous so the emission probability follows the same 
Gaussian profile as the X-ray pulse, with the width w] 
(ii) the probability for an Auger process to be emitted is 
a convolution of a Gaussian with the exponential decay 
characteristic for each individual atomic species. The ex- 
ponential decays are taken with corresponding life times 



5 



r of 11.3 fs for carbon, 8.3 fs for nitrogen and 6.6 fs for 
oxygen. The probability for photoionization in urea is 
determined by the cross section of the atoms, which are 
weh known For the three atomic species that can un- 
dergo an Auger process, the contribution from the atoms 
C, N, and O, is weighted according to the photoioniza- 
tion cross section on the respective atoms, ac, cr^, ctq, 
and normahzed to the total photoelectric cross section 
for the urea molecule. The single electron ionization cas- 
cades develop mainly along the direction of the incident 
photon, however we consider spherical symmetry when 
these are stochastically produced. Thus, the entire ion- 
ization cascade following an X-ray pulse impinging on a 
crystal is given by 

i=C,N,0 

(t^-to)2 \ ^t-t') 
T 

(2) 

where Cphoto(^,^0 CAuger{t^t^) represent the cascade 
development with time for a single electron starting from 
time f . These are obtained from MD simulations and are 
represented by the ionization rate as a function of time 
(Figure [2|l for = 0), or radii of gyration. Figures [2]3,c. 
3. X-ray interactions and damage quantification. 
The degradation of Bragg peaks in Figure [3] has been cal- 
culated from MD simulations on an urea crystal, using 
GROMACS with a stochastic interaction of X-ray photons 
with atoms, assuming unpolarized X-rays and homoge- 
neous spatial distribution of the free electrons. The sim- 
ulation box was 10x10x10 unit cells of urea, with peri- 
odic boundary conditions, and includes thermal motion 
of atoms. The model is described in reference [11^, and 
in the supplementary material. The intensity of Bragg 
peaks at each time step is defined by integrating around 
each peak over a rectangular area centered on the Bragg 
peak and with sides of length equal to 1/10 of the sepa- 
ration between adjacent peaks [18 . The spectral width 
A A/ A, beam divergence or any broadening of the Ewald 
sphere are not taken into account here. The degradation 
of the Bragg peak is expected to be smaller when inte- 
grating through an Ewald sphere of finite thickness. To 
estimate the damage induced error we make use of the 
R- value (used in Figure |3|, calculated up to a resolution 
q from 
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where the summation is performed over all Bragg peaks 
{hkl) corresponding to scattering vectors less than q. The 
intensities of the Bragg peaks /)]^^ for the undamaged 
crystal are used as reference when compared with the 
time averaged intensities {Ihki)t of the damaged crystal 
exposed to a Gaussian-shaped X-ray pulse. The latter 



intensities take into account the ionization dynamics and 
atomic displacement as a function of time during the X- 
ray pulse. 

4. Minimum required signal. In ultra-fast single- 
shot experiments at X-ray lasers the crystals will be ex- 
posed in random orientations. X-rays beams are expected 
to have small divergence (< 1 mrad) and are highly 
monochromatic (spectral bandwidth A A/ A < 0.1%). 
Thus, single shot diffraction patterns will contain many 
partially refiected Bragg peaks. Full Bragg peaks may 
be recorded on the detector at higher resolution, and 
could be used for retrieving the original orientation of 
the nanocrystals and for averaging the signal from simi- 
lar orientation [10 (partial Bragg peaks could in princi- 
ple also be used for indexing). In our estimates for the 
minimum required signal for successful indexing, we con- 
sider only the signal from fully integrated Bragg peaks 
at the lowest resolution where these can be recorded. 
For given parameters that control the thickness of the 
Ewald sphere (beam divergence A^, spectral bandwidth 
A A/ A), the lowest resolution where a full Bragg peak can 
be recorded (g'min) is found by comparing the thickness of 
the Ewald sphere to the size of the Bragg peak (modeled 
as inversely proportional to the crystal width A) 
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The average number of photons scattered elastically by 
a protein crystal within a Bragg peak for a given resolu- 
tion ^min can be approximated by the expression for the 
integrated refiected intensity [19 



lBragg(<7min) ^ ^o(A) 



A^ 



1 + cos^ Oo 



2sin^(l9o/2) 



2 



(5) 



/ ^ / atom ( 



where both small beam divergence and polychromatic- 
ity are accounted for. Io{X) is the spectral intensity of 
the incoming X-ray beam (number of incoming photons 
per unit wavelength and unit area), integrated over the 
angular density of the incident beam. The Lorentz fac- 
tor A^/(2 sin^(^o/2)) takes into account the integration 
over the thickness of the Ewald sphere encompassing the 
Bragg peak, in a similar way as [19^ for the stationary 
case (no rotation) with divergent polychromatic radia- 
tion. The polarization factor is given by (1 + cos^ 0o)/2. 
Furthermore, re is the classical electron radius, A the 
wavelength, A the crystal side (cubic crystal), a the unit 
cell side (cubic unit cell), /atom the atomic scattering fac- 
tor, and ^0 the polar angle between the incident pulse and 
the center of the Bragg peak. 

It is assumed that the unit cell structure factor is con- 
stant within the Bragg peak, and that adjacent Bragg 
peaks do not overlap; both these approximations improve 
with the ratio A/ a. The squared structure factor of the 
unit cell is represented by its average value at high scat- 
tering angles [20 . For numerical evaluation, the unit 
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cell was assumed to have a density of 1/30 carbon- 
equivalent atoms (corresponding to a unit cell consist- 
ing of 50% non-structural water and protein with density 
approximately 1.35 g/cm^), and the scattering factor of 
carbon was calculated from the Cromer-Mann parame- 
ters j2Tj. The calculation of the number of photons per 
Bragg peak (Equation [5| presented in Figure |4] with the 
assumption of perfect crystals with no mosaicity. It has 
been shown that micron sized crystals could consist of 
only a few highly ordered domains [22], thus nanocrys- 
tals are unlikely to be organized with a mosaic spread. 

The above approximations will break down when the 
crystal size approaches the unit cell size, as the diffracted 



image turns from a discrete into a continuous pattern. 
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